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Abstract 

Background: The family Mesoniviridae (order Nidovirales) comprises of a group of positive-sense, single-stranded 
RNA ([+]ssRNA) viruses isolated from mosquitoes. 

Findings: Thirteen novel insect-specific virus isolates were obtained from mosquitoes collected in Indonesia, 
Thailand and the USA. By electron microscopy, the virions appeared as spherical particles with a diameter of 
-50 nm. Their 20,129 nt to 20,777 nt genomes consist of positive-sense, single-stranded RNA with a poly-A tail. 
Four isolates from Houston, Texas, and one isolate from Java, Indonesia, were identified as variants of the species 
Alphamesonivirus-1 which also includes Nam Dinh virus (NDiV) from Vietnam and Cavally virus (CavV) from Cote 
d'lvoire. The eight other isolates were identified as variants of three new mesoniviruses, based on genome 
organization and pairwise evolutionary distances: Karang Sari virus (KSaV) from Java, Bontag Baru virus (BBaV) from 
Java and Kalimantan, and Kamphaeng Phet virus (KPhV) from Thailand. In comparison with NDiV, the three new 
mesoniviruses each contained a long insertion (180 - 588 nt) of unknown function in the 5' region of ORFIa, which 
accounted for much of the difference in genome size. The insertions contained various short imperfect repeats and 
may have arisen by recombination or sequence duplication. 

Conclusions: In summary, based on their genome organizations and phylogenetic relationships, thirteen new 
viruses were identified as members of the family Mesoniviridae, order Nidovirales. Species demarcation criteria 
employed previously for mesoniviruses would place five of these isolates in the same species as NDiV and CavV 
(Alphamesonivirus-I) and the other eight isolates would represent three new mesonivirus species {Alphamesonivirus-5, 
Alphamesonivirus-6 and Alphamesonivirus-7). The observed spatiotemporal distribution over widespread geographic 
regions and broad species host range in mosquitoes suggests that mesoniviruses may be common in mosquito 
populations worldwide. 
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Background 

The recently established virus family Mesoniviridae (order 
Nidovirales) comprises a group of positive-sense, single- 
stranded RNA ([+]ssRNA) insect viruses [1]. To date, the 
six described mesoniviruses, Cavally (CavV), DakNong 
(DKNG), Hana (HanaV), Meno (MenoV), Nam Dinh 
(NDiV) and Nse (NseV) have all been isolated from 
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naturally infected mosquitoes collected in just two coun- 
tries: Cote d'lvoire (West Africa) and Vietnam (Southeast 
Asia) [2-5]. Although these mosquito-associated viruses 
do not appear to infect vertebrates or to cause illness in 
humans or livestock, they are nonetheless of interest be- 
cause of the structural and genetic similarities to other 
members of the order Nidovirales, namely viruses in the 
families Coronaviridae, Arteriviridae, and Roniviridae. 
Furthermore, the basal phylogenetic position of the 
mesoniviruses in relation to the Coronaviridae has led 
some authors to suggest that the coronaviruses, and 
possibly other viruses in the order Nidovirales, may have 
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evolved in arthropods [3-5]. Indeed, members of the 
Roniviridae naturally infect marine shrimp and can 
cause severe pathology in these economically important 
arthropods [6]. 

In this communication, we report the isolation and 
characterization of 13 additional mesoniviruses from mos- 
quitoes collected in Thailand, Indonesia and the United 
States of America (USA). These 13 viruses appear to 
represent four distinct species in the family Mesoniviridae, 
three of which are novel. Based on their wide geo- 
graphic distribution, the limited sampling that has been 
done to date for these mosquito-specific viruses, and 
their broad species host range in mosquitoes, it seems 
likely that mesoniviruses are common in mosquito pop- 
ulations worldwide. The potential biological significance 
and effect of mesoniviruses on mosquito vector compe- 
tence is also discussed. 

Results 

Virus morphology 

All isolates had similar ultrastructure. Mature virions 
~50 nm in diameter were located at the surface of 
infected C6/36 cells, either as individual particles or 
in groups (Figure 1B,D), similar to what has been 
reported recently [7]. They displayed a dense nucleo- 
capsid core ~40 nm in diameter and surrounding enve- 
lope (Figure 1A,B). Mature virions of the same size 
could also be found inside intracytoplasmic vacuoles 
(Figure 1A,B,D,E,F,H). Some infected cells had para- 
crystalline arrays consisting of empty and full virus 
particles but with less electron-density than mature 
virions (Figure 1C). At the periphery of these arrays, 
mature virions could be observed either free in the 
cytosol or inside vacuoles (Figure 1C). Moreover a 
recent report [8], suggests that the mesonivirus virion 
diameter of -40-50 nm observed by us and others [7] 
may be significantly lower than its actual size, a 
discrepancy attributed to the method of preparation 
for alternative imaging technologies. Employing cryo- 
electron tomography, Warrilow et al. [8] demonstrated 
on the surface of some virions the presence of spike- 
like projections displaying a globular head attached to 
the virion surface through a low density stalk, similar 
to what has been observed in other members of the 
order Ni do virales [9]. 

Genome sequence and organization 

The complete nucleotide sequences of all 13 isolates 
were determined by high-throughput Illumina sequen- 
cing with end-finishing by 5'- and 3'-RACE. Excluding 
the 3'-poly [A] tail, the (+) ssRNA genomes ranged in 
size from 20,127 nt (V3872) to 20,777 nt (JKT-10701). 
The organization of each genome was similar to that 
described previously for the mesoniviruses (NDiV, CavV, 




Figure 1 Ultrastructure of mesoniviruses in C6/36 cells. 

Bars= 100 nm. (A) virus particles of Nam Dinh strain V3872 inside a 
cytoplasmic vacuole; (B) virions of Bontag Baru JKT-9876 strain at 
the cell surface (thin arrow) and inside a cytoplasmic vacuole (thick 
arrow); (C) virions of Bontag Baru strain JKT-9876 free in the cytosol 
(thin arrows). Thick arrow indicates a portion of a paracrystalline 
agglomerate of empty virus particles in the host cell cytosol; (D) virions 
of Bontag Baru strain JKT-9853 at the surface of the cell (thin arrow) 
and inside a cytoplasmic vacuole (thick arrow); (E) virus particles of 
Karang Sari (JKT-10701) inside an expanded cistern of granular 
endoplasmic reticulum (thick arrow); (F) virus particles of Bontag 
Baru JKT-7774 inside a cytoplasmic vacuole; (G and H) virus particles 
of Kamphaeng Phet strain KP84-0156 inside cytoplasmic vacuoles 
(thick arrows) and free in cytosol (thin arrow in H). 



HanaV, NseV and MenoV), featuring a long 5'-untrans- 
lated region (5'-UTR) of 359 to 370 nt, six major long 
open reading frames (ORFs), and a long terminal region 
of 1780 to 1804 nt preceding the poly[A] tail (Figure 2). 
An alignment that also included the five previously de- 
scribed mesoniviruses revealed block insertions in three 
groups of isolates: i) KP84-0156, KP84-0192 and KP84- 
0344 had a 180 nt insertion, ii) JKT-9853, JKT-9876, 
JKT-9891 and JKT-7774 had a 573 nt insertion, and iii) 
JKT-10701 had a 588 nt insertion. These insertions 
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Figure 2 Genome organisation of the mesoniviruses. In ORF1a and ORF1b the putative ribosomal frame-shift (RFS) element and the locations 
of conserved functional domains, including the 3C-like serine protease (3CLP), RNA-dependent RNA polymerase (RdRp), zinc-binding (Z), helicase 
(Hel), exoribonuclease (ExoN), N7-methyltransferase (NMT) and 2'-0-methyltransferase (OMT) domains, are shown. Sequence insertions in ORF1a 
are shown as black blocks. In ORF2a encoding the S glycoprotein the sites of proteolytic cleavage to generate SI, S2 and the uncharacterised 
N-terminal fragment are shown as black inverted triangles. Putative ORF4 that is relatively conserved in the 3'terminal region of the genome is 
shown as black rectangles and other small ORFs >180 nt that occur variously in the 3'-terminal domains and elsewhere in the genome are shown 
as grey rectangles. 



accounted for most of the observed difference in gen- 
ome size in the mesoniviruses (Figure 2). Excluding this 
region, which commenced -1300 nt from the 5'- 
terminus, the mesoniviruses shared global nucleotide 
sequence identity of ~43.7%. Maximum pairwise diver- 
gence was between MenoV and all other viruses (65.5% 
- 67.6% identity). 

Phylogenetic analysis and species determination 

To determine the phylogenetic relationships of the newly 
identified insect viruses, maximum likelihood (ML) phylo- 
genetic trees were constructed based on the amino acid 
alignments of ORF2a (unprocessed S protein) and a 
concatenated region of the highly conserved domains 



within ORFlab (3CL pro , RdRp and ZnHell). The phy- 
logenies exhibited highly similar topologies and strongly 
indicated that the viruses identified in this study cluster 
within the previously identified insect-specific nido- 
viruses in the genus Mesonivirus (NDiV, CavV, HanaV, 
NseVand MenoV) (Figure 3). 

A multiple sequence alignment of the 13 new genomes 
indicated high identity between multiple isolates, which 
was also confirmed by our phylogenetic analysis. Isolates 
KP84-0156, KP84-0192 and KP84-0344 differed only 
by ten nucleotide substitutions and a single 6-nt indel 
and formed a well-supported monophyletic clade in 
both phylogenies (Figure 3). These should be consid- 
ered isolates of the same virus (designated Kamphang 
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Figure 3 Phylogenetic analysis of the mesonivirus proteins. (A). Maximum-likelihood phylogeny of the full-length proteins encoded in ORF2a 
(unprocessed S proteins). (B). Maximum-likelihood analysis of conserved protein domains of ORFlab (3CL pm , RdRp, HEL1). Scale bars indicate 
amino acid substitutions/site. 



Phet virus - KPhV). Likewise, JKT-9853, JKT-9876 and 
JKT-9891 differed by only single nucleotide substitu- 
tions and JKT-7774 differed by 68 nucleotide substitu- 
tions and two indels of 3 and 6 nucleotides, and these 
four viruses formed a strongly supported monophyletic 
clade in the ML trees, suggesting that they can be con- 
sidered isolates of the same virus (designated Botang 
Baru virus - BBaV). Isolates 16740, 16757, V3872 and 
V3982 were also closely related, differing by only 18-57 
nucleotide substitutions (~0.1-0.3%) across the entire 
genome. The clade formed by these isolates in each ML 
tree clustered within a closely related monophyletic 



group that also included NDiV and isolate JKT-9982 
(Figure 3). These can all be considered strains of NDiV. 
Finally, isolate JKT- 10701 was clearly distinct from all 
other mesoniviruses in our phylogenetic analysis, and 
was considered a unique virus, Karang Sari (KSaV). 

It has recently been suggested that pairwise evolutionary 
distances (PED) can be used as a species demarcation 
criterion in the genus Mesonivirus [6]. We calculated 
the PED between the 13 new isolates described in this 
study and those of five previously described mesoni- 
viruses (CavV, HanaV, NDiV, NseV and MenoV) using 
the conserved protein domains of ORFlab (3CL pro , 
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RdRp and ZnHell). Based on this analysis, NDiV (in- 
cluding the Ngewontan and Houston strains) and CavV 
can be considered a single species, which has previ- 
ously been designated at Alphamesonivirus- 1 . The PED 
between each of these strains was substantially less than 
the suggested cutoff of 0.032 for species demarcation [6]. 
By this method, each of the other previously described 
mesoniviruses - HanaV MenoV, and NseV - would also be 
considered unique species, as the PED between these 
viruses and all other mesoniviruses were considerably 
greater than 0.032. These viruses could be assigned to 
the species Alphamesonivirus-2, Alphamesonivirus-3 
and Alphamesonivirus-4, respectively. Similarly, novel 
viruses BBaV, KSaV and KPhV would each also be 
assigned to separate mesonovirus species: Alphamesoni- 
virus-5, Alphamesonivirus-6 and Alphamesonivirus-7, 
respectively. Strikingly, the vast majority of the genetic 
diversity that was measured within the genus Mesoni- 
virus was present at the inter-species level (Additional 
file 1: Figure SI). This pattern strongly supports the 
assignment of the current diversity of mesoniviruses 
into seven distinct species. 

Open reading frames 

In mesoniviruses, ORFla features a 3C-like protease 
(3CLpro) domain and flanking transmembrane domains, 
which are highly conserved amongst nidoviruses; these 
domains were identified in each of the new viruses. 
However, the block nucleotide sequence insertions 
resulted in a highly variable region near the N-terminus 
of ORFla with insertions of various lengths ranging 
from 13 aa (MenV) to 196 aa (KSaV) (Additional file 2: 
Figure S2). The inserted sequences (which apparently 
are unique to the mesoniviruses) featured variations on 
a common sequence (SKRKGK) at the terminal points. 
There were also various short stretches of this sequence 
and other imperfectly repeated sequences within each 
insertion region suggesting possible origins through 
recombination or sequence duplication events. 

As has been reported previously for other mesoni- 
viruses, ORFla is overlapped by ORFlb which contains 
highly conserved regions associated with the replicase 
complex including RNA-dependent RNA polymerase 
(RdRp), multinucleate zinc-binding module and associated 
helicase (ZnHell), exoribonuclease (ExoN), N 7 -methyl- 
transferase (NMT) and 2'-0-methyltransferase (OMT) 
domains. The ORFla/ORFlb overlap includes a puta- 
tive 'slippery' sequence (GGAUUUU), allowing ORFlb 
expression as a pplab polyprotein through a -1 riboso- 
mal frame shift. 

ORF2a encodes the S glycoprotein. In CavV, the S pro- 
tein (p77) has been shown to be generated from the full- 
length ORF2a polyprotein by internal signalase cleavage 
at the site 2 2oNAHC|STRID and is further processed to 



generate cleavage products SI (p23) and S2 (p57), each 
of which is a structural component of virions [5]. A 
MUSCLE alignment of all available mesonivirus ORF2a 
polyprotein sequences (Additional file 3: Figure S3) indi- 
cated that the signalase cleavage site is relatively variable 
conforming only to the sequence motif [C/S] | [L/A/S] 
TRIDL and that the S1-S2 cleavage site (R|WDSSYV) is 
highly conserved. The S protein, a class I transmem- 
brane glycoprotein with a C-terminal transmembrane 
domain, features a set of 12 conserved cysteine residues 
which are likely to form 6 disulphide bridges, and 7 to 
11 potential N-glycosylation sites, only 4 of which are 
conserved across all mesoniviruses. Surprisingly, an N- 
terminal signal peptide is strongly predicted for the 
MenoV S protein (between amino acids A 20 and S 21 ) but 
not for those of any of the other mesoniviruses (http:// 
www.cbs.dtu.dk/services/SignalP). There is a relatively 
high level of amino acid sequence conservation in the SI 
and S2 proteins (47.2% and 51.7% global identify, re- 
spectively). However, the N- terminal product of signa- 
lase cleavage of the ORF2a polyprotein displays very low 
sequence conservation amongst the mesoniviruses (7.4% 
global identity). This product, which is predicted to be 
positioned on the inside of the ER membrane (TMHMM 
server; www.cbs.dtu.dk/services/TMHMM-2.0) has not 
yet been identified in virions or infected cells. 

In each of the mesoniviruses, the precise region of the 
genome encoding the highly variable N-terminal signa- 
lase cleavage product of the ORF2a polyprotein also 
contains an alternative open reading frame (ORF2b) 
which has been shown in CavV to encode the putative 
nucleoprotein p25 [5]. Our analysis indicated that the 
predicted molecular weights of the unmodified mesoni- 
virus N proteins range from 23.69 kDa (NgeV) to 
25.38 kDa (KSaV) and all are highly basic (pi >10). They 
display a moderate level of overall sequence conserva- 
tion (34.1% global identity) due primarily to two highly 
conserved domains corresponding in KSaV to G 95 to 
A 176 (64.3% global sequence identity) and L 195 to F 210 
(75.0% global sequence identity). 

The third coding region, commencing immediately 
downstream of ORF2a, contains two overlapping long 
open reading frames (ORF3a and ORF3b) each encoding 
small hydrophobic proteins (Figure 4). A Clustal X align- 
ment of the mesonivirus ORF3a proteins and individual 
structural analyses using SignalP and TMHMM and 
NetNGlyc (www.expasy.org) indicated that each is a 
class I transmembrane glycoprotein with a predicted 
N-termimal signal peptide, an ectodomain containing a 
conserved set of 6 cysteine residues and a single con- 
served N-glycosylation site, a transmembrane domain 
and a C-terminal cytoplasmic domain (Figure 4A, 4D). 
The cysteine-rich region of the ectodomain displays mod- 
erately high sequence conservation including a stretch of 
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Figure 4 Polypeptides encoded in the ORF3a/ORF3b region of mesonivirus genomes. (A). Clustal X alignment of the sequences of 
polypeptides encoded in 0RF3a illustrating the predicted signal peptide, exodomain, transmembrane domain and cytoplasmic domain, two 
domains of highly conserved sequence (boxed), conserved cysteine residues (black shading) and a potential N-glycosylation site (underlined). 
(B). Clustal X alignment of the sequences of polypeptides encoded in 0RF3b illustrating the cytoplasmic domain, transmembrane domain and 
ectodomain and a potential N-glycosylation site (underlined). (C). Clustal X alignment of the sequences of polypeptides that would be generated 
by -1 ribosomal frame-shift at the predicted 'slippery' sequence (CACUUUU) in the overlap region. (D). Schematic representation of the predicted 
membrane topology of the putative 3a, 3b and 3ab glycoproteins using SignalP and TMHMM and NetNGIyc (www.expasy.org). 



13 totally conserved amino acids adjacent to the trans- 
membrane domain that is unusually rich in large aro- 
matic residues (Y, W, F). The cytoplasmic domain is 
highly variable in sequence, except for the completely 
conserved motif (H/S)YIPLLPR. No similar motif has 
been detected in a search of available eukaryote se- 
quences. The ORF3b proteins are each predicted to be 
class II transmembrane proteins with an N-terminal cyto- 
plasmic domain, a transmembrane domain and a short C- 
terminal ectodomain (Figure 4B, 4D). A single N- 
glycosylation site was predicted in the ectodomain of all 



the mesoniviruses, except NseV. The ORF3b proteins 
display poor overall sequence conservation and have 
few features that suggest a biological function. 

The long 3'-terminal region of all mesoniviruses, 
except MenoV, contained a small open reading frame, 
which was previously designated as ORF4. A Clustal x 
multiple sequence alignment of the putative proteins 
indicated that, although they varied in size (45 aa to 61 aa), 
there was a remarkable level of sequence identity, particu- 
larly in the N-terminal portion of the protein (Additional 
file 4: Figure S4). However, there are no obvious structural 
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characteristics that suggest a function and it is not 
known if these proteins are expressed in infected cells. 

Various other short ORFs (<150 nt) were detected in 
the 3'-terminal regions of each mesonivirus but none 
was obviously conserved. Alternative ORFs of >180 nt (i.e., 
60 amino acids) were also detected within other ORFs, 
including in ORF2a in the original strain of NDiV (72 aa) 
and the Ngewontan (72 aa) and Houston (61 aa) strains, 
in which the 5' region is missing (Figure 2). These ORFs 
share a relatively high degree of nucleotide sequence 
conservation (92%) but a lower level of amino acid 
sequence conservation (79%) and no obvious structural 
characteristics indicate the possible function of the 
encoded proteins. Another short ORF that displayed 
some degree of sequence conservation was found in 
the 3' region of ORFla for all viruses except MenoV 
and KPhV (Figure 2). 

Transcription regulatory sequences 

Mesoniviruses have been shown previously to express a 
3'-nested set of polyadenylated sub-genomic mRNAs 
produced by copy-choice mediated leader-body fusion 
at sites located immediately upstream of alternative 
transcription regulatory sequence (TRS) elements which 
occur in the 5'-UTR and in the regions preceding ORF2a/ 
ORF2b and ORF3a/ORF3b [5]. Each of the previously 
identified leader-body fusion sites and TRS elements 
[AUxxUACUACUACUA and AGAx(x)ACUCUCCCA] 
were completely conserved across all new mesoni- 
viruses examined in this study. 

Ribosomal frame-shift sites 

In nidoviruses, translation of ORFlb characteristically 
occurs through a ribosomal frame-shift at a 'slippery' 
sequence in the ORFla/ORFlb overlap region to allow 
read-through synthesis of a polyprotein (pplab). This 
is usually facilitated by an RNA pseudoknot in the 
sequence immediately downstream of the frame-shift site. 
However, a previous analysis of the NDiV sequence failed 
to identify a predicted pseudoknot structure, suggesting 
that a stem-loop structure (predicted using pknotsRG; 
http://bibiserv.techfak.uni-bielefeld.de/pknotsrg/) in the 
same region of the genome may facilitate the frame-shift 
[3]. However, our analysis using pknotsRG of the 13 new 
mesonivirus sequences and those of CavV, HanaV, NDiV, 
NseV and MenoV indicated that, although the slippery 
sequence (GGAUUUU) is completely conserved, the 
stem-loop structure predicted for NDiV is not predicted 
as the minimum free energy structure for any of the 
other viruses. Indeed, an alignment of the corresponding 
region of the genome of all mesoniviruses (Additional 
file 5: Figure S5) indicated that there was poor con- 
servation of this sequence and few compensatory mu- 
tations that would preserve the stem-loop structure 



(Figure 5a). As functional pseudoknot structures are 
usually conserved, an alignment of all mesonivirus 
sequences extending 150 nt from the start of the 'slippery 
sequence' was analyzed using the IPknot server (http:// 
rna.naist.jp/ipknot) which predicts the consensus sec- 
ondary structure from a multiple sequence alignment 
(by using integer programming to select the maximum 
expected accuracy (MEA) structure) [10]. As shown in 
Figure 5b, the IPknot algorithm predicted a conserved 
pseudoknot structure, which conformed to all nucleo- 
tide substitutions in the available mesoniviruses. 

Analysis of the region of the ORF3a/ORF3b overlap 
indicated that it has a similar format to that of the 
ORFla/ORFlb overlap region, featuring a long overlap 
region and potential slippery sequence (CACUUUU) that 
could result in read-through by a -1 ribosomal frame- 
shift. Structural analyses using SignalP and TMHMM and 
NetNGlyc (www.expasy.org) indicated that a frame-shift 
at this site would result in a double-membrane-spanning 
glycoprotein (p3ab) of approximately 30-33 kDa (de- 
pending on whether one or two N-glysosylation sites are 
occupied) with both the N-terminal and C-terminal 
domains located in the extracellular lumen (Figure 4c, d). 
A previous analysis of CavV proteins by mass spectrom- 
etry identified peptide sequences from proteins migrating 
in gels at 17, 18 19 and 20 kDa that corresponded to the 
3a protein (designated the M protein) and these were 
considered to be variously glycosylated forms; no pro- 
tein corresponding to the ORF3b gene product was 
detected [5]. As for the ORFla/ORFlb region, analysis 
of the genome region downstream of the putative riboso- 
mal frame-shift element (RFS) element using pknotsRG 
identified various potential stem-loops but no commonly 
predicted minimum free energy structure. Analysis of a 
multiple sequence alignment of the region using the 
IPknot server also failed to predict a convincing pseudo- 
knot structure. 

Discussion and conclusions 

In this study we have identified 13 mesonivirus isolates 
and characterized their genome organization and phylo- 
genetic relationships. Based on species demarcation 
criteria employed previously for mesoniviruses [1], five 
of these new isolates would be assigned to the same 
species as NDiV and CavV (Alphamesonivirus-1), other 
previously described mesoniviruses - HanaV MenoV, and 
NseV - would be assigned to three new species {Alphame- 
sonivirus-2, Alphamesonivirus-3 and Alphamesonivirus-4, 
respectively), and eight of the new isolates would repre- 
sent three new species (Alphamesonivirus-5, Alpha 
mesonivirus-6 and Alphamesonivirus-7) [6]. However, 
we consider this basis for species demarcation, which 
employs only a genetic standard of pairwise sequence 
divergence to assign viruses to a species, should be 
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Figure 5 Predicted structures of the mesonivirus ORF1a/ORF1b ribosomal frame-shift element (RFS) element. (A) Stem-loop structure 
predicted previously for NDiV [3] using pseudoknotsRG software, illustrating nucleotides that are substituted in various viruses (red circles). Nucleotide 
substitutions (which are primarily non-compensatory) are shown in the boxes. The corresponding sequence alignment is shown in Additional file 4: 
Figure S4. (B). An alternative conserved pseudoknot structure predicted from the multiple sequence alignment using ipknot software. The putative 
'slippery' sequence (GGAUUUU) at the ribosomal frame-shift site is shaded in grey. 



re-evaluated following further assessment of the ecol- 
ogy of these viruses and their potential for genetic 
recombination which may provide a more informed 
analysis of suitable species demarcation criteria. 

The isolates of viruses assigned to the species Alpha- 
mesonivirus-1 illustrate the wide geographic distribution 
and mosquito host range of some mesoniviruses. The 
original four isolates of NDiV were made in northern 
and central Vietnam from Culex vishnui and Cx. tritae- 
niorhynchus mosquitoes collected indoors during a sur- 
veillance program for Japanese encephalitis virus (JEV) 
[3]. Our NDiV isolate from Java, Indonesia (Ngewontan 
strain) was also obtained from a pool of Cx. vishnui 
mosquitoes in 1981. The four NDiV isolates from Houston, 
Texas (Houston strain) were made from Cx. quinquefas- 
ciatus and Aedes albopictus collected outdoors within 
the Houston metropolitan area during West Nile virus 
(WNV) surveillance in 2004 and 2010, respectively. 
Interestingly, all of these isolates were from mosquitoes 
captured in or near human dwellings and from species 



that feed on humans. The close similarity of isolates 
from Houston and Vietnam may suggest a recent trans- 
location, possibly during the Vietnam conflict when 
Houston hosted a major air base for embarkation/dis- 
embarkation. Isolates of Kampaeng Phet, Botang Baru 
and Karang Sari viruses, which may be considered three 
new mesonivirus species, have a geographic distribution 
extending at least from central Thailand to Kalimantan 
and Java in Indonesia, and have been isolated from at 
least two species of Culex mosquitoes. 

Another consideration is the potential effect of meso- 
nivirus infection on the susceptibility and vector compe- 
tence of mosquitoes for viral pathogens of vertebrates. 
For example, both Cx. vishnui and Cx. tritaeniorhynchus 
are important vectors of JEV in Asia, and Cx. quinque- 
fasciatus is the major vector of WNV in Houston. 
Recent experimental studies with Ae. aegypti mos- 
quitoes infected with certain strains of the Wolbachia 
indicate that the presence of the symbiont bacterium 
interferes with dengue virus replication and decreases 
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vector competence, possibly by upregulating or priming the 
mosquito's innate immune system [11,12]. Similar results 
have been reported for Wolbachia-infected Ae. aegypti 
and chikungunya virus [11] and with Wolbachia-in- 
fected Cx. quinquefaciatus and WNV [13]. If a bacterial 
endosymboint can alter a mosquito's vector competence 
for arboviruses, it seems plausible that a viral symbiont 
could have a similar effect [14]. This is an important 
area for future investigation. 

Due to the continuous efforts of the virus discovery 
program of World Reference Center for Emerging Viruses 
and Arboviruses (WRCEVA), we have continued to isolate 
mesoniviruses from various insect vectors collected from 
widespread geographic locations (e.g., Nepal, Colombia 
and South Florida) suggesting that these viruses are 
more common than previously thought. Zirkel et al. [4] 
suggested that these viruses may have their origins in 
pristine rainforests and emergence may have been faci- 
litated through anthropogenic-induced modifications 
(e.g., altered land use, deforestation). 

The detailed analysis and comparison of mesonivirus 
genome architecture conducted here has revealed some 
unexpected characteristics. Firstly, the presence of block 
insertions of up to 588 nt in the 5' terminal quadrant 
of ORFla of several mesoniviruses has not been reported 
previously. The function of this region is presently 
unknown in mesoniviruses and other nidoviruses and 
so the structural and functional consequences of these 
insertions, which contain various imperfect repeats is 
unclear. Although sharing similar genome architecture, 
nidoviruses vary greatly in genome size. A previous ana- 
lysis of the evolution of nidovirus genomes concluded that 
genome expansion has occurred in a wave-like fashion in 
which the three major coding regions (ORFlb, ORFla and 
the 3'ORFs) expanded consecutively in a hierarchy that 
reflects the roles of their encoded proteins in the virus 
replication cycle [15]. This implies that nidoviruses have 
an inherent capacity for genome expansion, most likely 
associated with the transitional retention of sequences that 
serve as a resource for the evolution of new functions. 
The block insertions detected in ORFla appear to be func- 
tionally redundant and their potential role in such evolu- 
tionary processes is presently unclear. 

The comparative analysis also revealed that the stem- 
loop structure which had previously been identified at 
the RFS site of NDiV is not conserved in the other 
mesoniviruses and so may not be responsible for acti- 
vating the -1 ribosomal frame shift. Secondary struc- 
ture predictions using IPknot on the aligned sequences 
downstream of the conserved 'slippery' sequence site 
revealed a conserved pseudoknot structure that con- 
formed to all nucleotide substitutions. However, the 
predicted structure featured only four relatively short 
regions of complementarity and no estimations of 



minimum free energy for the represented structures are 
available through this algorithm. A possible 'slippery' 
sequence (CACUUUU) was also detected in the ORF3a/ 
ORF3b overlap region but no conserved stem-loop or pseu- 
doknot structure was predicted by IPknot in the down- 
stream sequence. It is, therefore, unclear whether ORF3b is 
expressed by internal initiation or as a read-through exten- 
sion of ORF3a, which would generate a double-membrane 
spanning protein. Functional analysis of each of the regions 
corresponding to the RFS in ORFla/ORFlb and the puta- 
tive RFS in ORF3a/ORF3b would help resolve the mecha- 
nisms of mesonivirus gene expression. 

In conclusion, we have identified and characterized 
several new mesoniviruses from mosquitoes of human 
medical importance sampled over time from wide- 
spread geographic regions. Several important questions 
related to their transmission, maintenance in insect 
hosts in nature, their potential impact of infection on 
the insect's behavior, fertility, fecundity and survival, 
their evolution, their mechanisms of gene expression 
and their potential to be developed as biological control 
agents, warrant further investigation. 

Methods 

Cells and viruses 

All viruses used in this study were obtained from the 
WRCEVA at the University of Texas Medical Branch. 
Some were isolated by the authors (RBT and HG), dur- 
ing arbovirus field studies; the remainder were isolated 
by other investigators and sent to the WRCEVA for 
identification and further characterization. All isolations 
were originally made in mosquito cell cultures (C6/36 or 
AP-61). The proposed names, original sources and geo- 
graphic origins and GenBank Accession numbers of the 
sequences obtained for the 13 viruses included in our 
study are listed below and in Table 1. 

JKT-10701 was isolated from a pool of Culex vishnui 
mosquitoes collected on 11/26/1981 at Karang Sari, 
Cilacap (Central Java) Indonesia. The initial isolation 
was made at the Naval Medical Research Unit #2 
(NAMRU-2) in Jakarta. 

JKT-7774 was isolated at NAMRU-2 from a pool of 50 
Culex vishnui collected at Bontag Baru, East Kalimantan, 
Indonesia in February 1981. Strains JKT-9876, JKT-9891 
and JKT-9853 were also isolated at NAMRU-2 appear to 
be almost identical to isolate JKT-7774 in the phylogen- 
etic tree. Since JKT-7774 was the first virus in this group 
to be isolated, it should be the prototype. 

JKT-9876 was isolated at NAMRU-2 from a pool of 
Culex vishnui collected at Cilacap, Central Java, Indonesia 
on 10/13/1981. 

JKT-9891 was isolated at NAMRU-2 from a pool of 
Culex vishnui mosquitoes collected on 10/14/1981 at 
Karang Sari, Cilacap (Central Java), Indonesia. 
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Table 1 Origin of the 13 Mesoniviruses viruses described in this study 

Proposed name Strain Proposed virus GenBank Host species Collection locality Collection date 

identification designation species accession no. 



Ngewontan 


JKT-9982 


Alphamesonivirus- 


-1 


KC807169 


Houston 


V3872 


Alphamesonivirus- 


-1 


KC807175 


Houston 


V3982 


Alphamesonivirus- 


-1 


KC807176 


Houston 


16740 


Alphamesonivirus- 


-1 


KC807177 


Houston 


16757 


Alphamesonivirus- 


-1 


KC807178 


Bontag Baru 


JKT-7774 


Alphamesonivirus- 


-5 


KC807173 


Bontag Baru 


JKT 9891 


Alphamesonivirus- 


-5 


KC807167 


Bontag Baru 


JKT-9853 


Alphamesonivirus- 


-5 


KC807168 


Bontag Baru 


JKT-9876 


Alphamesonivirus- 


-5 


KC807166 


Karang Sari 


JKT-10701 


Alphamesonivirus- 


-6 


KC807171 


Kamphaeng Phet 


KP 84-0192 


Alphamesonivirus- 


-7 


KC807174 


Kamphaeng Phet 


KP 84-0344 


Alphamesonivirus- 


-/ 


KC807172 


Kamphaeng Phet 


KP 84-01 56 


Alphamesonivirus- 


-/ 


KC807170 



a Species of the mosquito pool was not provided. 



JKT-9853 was isolated at NAMRU-2 from a pool of 
Culex tritaeniorhynchus collected at Cilacap, Central 
Java, Indonesia on 10/6/1981. 

KP84-0192, KP84-0344 and KP84- 0156 were isolated 
from pools of mosquitoes (species not given) collected 
at Kamphaeng Phet, Thailand in 1984. Original isola- 
tions were made at the Armed Forces Research Institute 
for Medical Sciences, Bangkok. 

JKT-9982 was isolated at NAMRU-2 from a pool 
of Culex vishnui mosquitoes collected at Ngewotan, 
Central Java, Indonesia on 11/17/1981. 

V3872 and V3982 - Both viruses were isolated at UTMB 
from pools of Culex quinquefasciatus mosquitoes collected 
in the summer of 2004 as part of an arbovirus surveillance 
program in Houston (Harris County), Texas, USA. 

TVP16740 and TVP16757 - These two viruses were 
also isolated at UTMB from pools of Aedes albopictus 
mosquitoes collected in June 2010 as part of the Harris 
County arbovirus surveillance program in Houston, Texas. 

Before sequencing, all virus stocks were grown in 
cultures of the C6/36 clone of Ae. albopictus cells [16], 
obtained from the American Type Culture Collection 
(ATCC), Manassas, VA. Infection was characterized by 
detachment of cells and cell lysis. 

Nucleotide sequence accession numbers 

The following mesonivirus genome sequences were 
determined in this study: Karang Sari virus, strain JKT- 
10701 (KC807171); Bontang virus, strain JKT-9891 
(KC807169); strain JKT-9853 (KC807167); strain JKT- 
9876 (KC807168); strain JKT-7774 (KC807166); Kam- 
phang Phet virus, strain KP 84-0192 (KC807173); strain 
KP 84-0344 (KC807174); strain KP 84-0156 (KC807172); 



Culex vishnui 


Ngewontan, Java, Indonesia 


1981 


Culex quinquefasciatus 


Houston, Texas, USA 


2004 


Culex quinquefasciatus 


Houston, Texas, USA 


2004 


Aedes albopictus 


Houston, Texas, USA 


2010 


Aedes albopictus 


Houston, Texas, USA 


2010 


Culex vishnui 


Bontag Baru, Kalmantan, 
Indonesia 


1981 


Culex vishnui 


Karang Sari, Java, Indonesia 


1981 


Culex tritaeniorhynchus 


Cilacap, Java, Indonesia 


1981 


Culex vishnui 


Cilacap, Java, Indonesia 


1981 


Culex vishnui 


Karang Sari, Java, Indonesia 


1981 


Mosquito pool 3 


Kamphaeng Phet, Thailand 


1985 


Mosquito pool 3 


Kamphaeng Phet, Thailand 


1984 


Mosquito pool 3 


Kamphaeng Phet, Thailand 


1984 



NDiV Ngewotan strain JKT-9982 (KC807170); NDiV 
Houston strain V3872 (KC807175), V3982 (KC807176), 
16740 (KC807177), and 16757 (KC807178). 

Transmission electron microscopy 

For ultrastructural analysis in ultrathin sections infected 
cells were fixed for at least 1 hr in a mixture of 2.5% 
formaldehyde prepared from paraformaldehyde powder, 
and 0.1% glutaraldehyde in 0.05 M cacodylate buffer 
pH 7.3 to which 0.03% picric acid and 0.03% CaCi2 were 
added. The monolayers were washed in 0.1 M cacodylate 
buffer, cells were scraped off and processed further as a 
pellet. The pellets were post-fixed in 1% OsC^ in 0.1 M 
cacodylate buffer pH 7.3 for 1 h, washed with distilled 
water and en bloc stained with 2% aqueous uranyl acetate 
for 20 min at 60°C. The pellets were dehydrated in etha- 
nol, processed through propylene oxide and embedded in 
Poly/Bed 812 (Polysciences, Warrington, PA). Ultrathin 
sections were cut on Leica EM UC7 ultramicrotome 
(Leica Microsystems, Buffalo Grove, IL), stained with lead 
citrate and examined in a Philips 201 transmission elec- 
tron microscope at 60 kV. 

Next generation sequencing 
Library construction 

Viral RNA (0.05-1.7 ug) was fragmented by incubation at 
94°C for 8 min in 19.5 ul of fragmentation buffer (Illumina 
15016648). First and second strand synthesis, adapter 
ligation and amplification of the library were performed 
using the Illumina TruSeq RNA Samplec Preparation 
kit under conditions prescribed by the manufacturer 
(Illumina). Samples were tracked using the "index tags" in- 
corporated into the adapters as denned by the manufacturer. 
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Sequence analysis 

Cluster formation of the library DNA templates was per- 
formed using the TruSeq PE Cluster Kit v3 (Illumina) 
and the Illumina cBot workstation using conditions 
recommended by the manufacturer. Paired end 50 base 
sequencing by synthesis was performed using TruSeq 
SBS kit v3 (Illumina) on an Illumina HiSeq 1000 using 
protocols defined by the manufacturer. Cluster density 
per lane was 645-980 k/mm 2 and post filter reads 
ranged from 148-178 million per lane. Base call conver- 
sion to sequence reads wasperformed using CASAVA- 
1.8.2. Virus sequences were edited and assembled using 
the SeqMan and NextGen modules of the DNAStar 
Lasergene 7 program (Bioinformatics Pioneer DNAStar, 
Inc., Madison, WI). In certain cases, prefiltering of reads 
to remove host sequence enhanced the assembly process. 
Assembly was carried out using a fasta file of Aedes albo- 
pictus sequences to remove host DNA from the assembly 
thus reducing the number of contigs present. 

Genomic analysis 

The compiled sequences had their relationship to other 
viruses determined by a BLASTX search. The open 
reading frames were determined using Enzyme X 
(Nucleobytes, Inc., Aalsmeer). Relationships to the other 
two insect-specific nidoviruses were determined using 
MacVector (Cary, NC) using their DNA identity matrix 
software. The presence of conserved protein domains 
was determined using the SMART webserver [17,18]. 

Phylogenetic analysis 

A region corresponding to the full-length product 
encoded by ORF2a (unprocessed S proteins) was used to 
determine the relationships within the family Mesoniviri- 
dae along with a concatenated region of the highly 
conserved domains within ORFlab (3CL pro , RdRp and 
ZnHell). ORF2ab alignments were determined using the 
MUSCLE algorithm [19] as amino acids before being 
toggled back to the nucleotides while maintaining the 
alignment. Concatenated ORFlab alignments were per- 
formed at the protein level. A maximum likelihood (ML) 
tree for ORF2a was constructed in MEGA 5.2 using the 
Jones-Taylor-Thornton substitution model of nucleotide 
substitution, uniform substitution rates among sites and 
the Nearest Neighbor-Interchange heuristic method and a 
very strong branch swap filter. An optimal ML tree was 
then estimated using the appropriate model and a heur- 
istic search with tree-bisection-reconstruction branch 
swapping and 10 replicates, estimating variable param- 
eters from the data, where necessary. A maximum like- 
lihood phylogeny of the conserved domains of ORFlab 
was constructed using PhyML 3.0 [20], the WAG + 
Gamma model of amino acid substitution, and the 
NNI + SPR method of branch swapping. One thousand 



bootstrap replicates were calculated for each dataset 
under the same models and expressed as a percentage. 

Calculation of the PED between the highly conserved 
protein domains of ORFlab (3CL pro , RdRp and ZnHell) 
of the thirteen new isolates described in this study and 
those of five previously described mesoniviruses (CavV, 
HanaV, NDiV, NseV and MenoV) was performed using 
the ML method in the program TreePuzzle [21] and the 
WAG model of amino acid substitution. Sliding window 
analysis was used to calculate the mean amino acid 
divergences within and between the seven mesonivirus 
species described in this study. Percent divergences 
were calculated for each of the three largest ORFs in the 
genome (ORF lab, ORF2a, ORF2b) using the program 
SSE [22]. 

Additional files 



Additional file 1: Figure SI. Sliding window analysis of the pairwise 
amino acid distances within and between the seven putatively designated 
mesonivirus species for ORFlab (replicase proteins), 0RF2a (S) and 0RF2b (N). 

Additional file 2: Figure S2. A Clustal X multiple sequence alignment 
of mesonivirus pplab polyproteins illustrating region containing the block 
insertions (yellow shading) and various imperfectly repeated sequences that 
occur at the boundary and within the blocks of inserted sequence. 

Additional file 3: Figure S3. A Clustal X multiple sequence alignment 
of the polypeptides encoded in ORF2a (5 proteins) of the mesoniviruses. 
A predicted signal peptide in MenoV and predicted transmembrane 
domains in all mesoniviruses are shaded in aqua, predicted N-glycosylation 
sites are shaded in green, cysteine residues are shaded in yellow and the 
sites of proteolytic cleavage to generate glycoproteins SI and S2 and the 
unidentified N-terminal fragment are shaded in purple. 

Additional file 4: Figure S4. A Clustal X multiple alignment of the 
sequences of putative polypeptides encoded on ORF4 which occurs in 
the 3'-terminal regions of all mesoniviruses except MenoV. To emphasize 
the alignment, the KSaV 0RF4 protein has been shown to commence at the 
next available methionine residue located 34 amino acids downstream of 
the predicted initiation codon. 

Additional file S: Figure S5. A Clustal X multiple sequence alignment 
of region immediately downstream of the ORFla/ORFIb RFS that has 
been predicted previously to adopt a stem-loop structure in NDiV. The 
alignment illustrates sequence variations in nucleotides predicted in NDiV 
to be involved in base pairs that sustain the structure. 
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